The VUB Blizzard Challenge 2010 Entry: Towards Automatic Voice Building
نویسندگان
چکیده
In this paper we describe the voices we submitted to the 2010 Blizzard Challenge, a yearly challenge to evaluate auditory speech synthesis on common data. One of the goals of a datadriven synthesizer, such as ours, is to generalize the speech database in such a way that it allows a realistic rendition of unseen input text. The two main changes to our system, compared to previous submissions, are the inclusion of an HMM-based acoustic prosody model, and the automatic training of context-dependent target cost weights. These weights are estimated for each individual target during synthesis, and depend on the linguistic features of these targets which encompass their broader linguistic context. Another new aspect of our synthesizer is the ability to synthesize Mandarin Chinese speech. Its evaluation helps us assess the quality of our synthesizer for languages unfamiliar to the voice developers. Evaluation results and possible improvements to our synthesizer are also discussed.
منابع مشابه
The VUB Blizzard Challenge 2009 Entry
In this paper we describe the voices we submitted to the 2009 Blizzard Challenge, a yearly challenge to evaluate auditory speech synthesis on common data. Since it is the second time we participate in this challenge, in this paper we focus on the changes we made to our unit selection-based system. The weighted sum of symbolic target costs has been replaced by a single statistical target cost; t...
متن کاملAn Overview of the VUB Entry for the 2008 Blizzard Challenge
In this paper, we describe the configuration of our synthesizer, as used for the Blizzard Challenge the first time. Two new UK English voices were built for the DSSP synthesizer, our in-house unit selection synthesizer, which uses non-uniform units and a symbolic description of target prosody. Listening tests indicate reasonable quality although there is still room for improvement.
متن کاملThe ILSP Text - to - Speech System for the Blizzard Challenge 2011
This paper describes ILSP and INNOETICS Speech Synthesis System entry for the Blizzard Challenge 2011 competition. A description of the underlying system and techniques used are provided, as well as information about the voice building process and discussion on the obtained evaluation results.
متن کاملThe AHOLAB Blizzard Challenge 2008 Entry
This paper describes the process of building unit selection voices for our participation in the Blizzard Challenge 2008. Out of the three voices required (15 hours UK English, 1 hour UK English subset and 6.5 hours Mandarin Chinese) we only built the English ones.
متن کاملExpressive Speech Synthesis for Storytelling: The INNOETICS' Entry to the Blizzard Challenge 2016
This paper describes INNOETICS' Speech Synthesis System entry for the Blizzard Challenge 2016, along with the corresponding results and some relevant discussion. We provide a description of the underlying system and techniques used in our TTS platform, as well as some detailed information regarding the voice building process. Based on the obtained results from the listening experiments, we atte...
متن کامل